A Statistical Theory of Dependency Syntax
نویسنده
چکیده
A generative statistical model of dependency syntax is proposed based on Tesni ere's classical theory. It provides a stochastic formalization of the original model of syntactic structure and augments it with a model of the string realization process, the latter which is lacking in Tesni ere's original work. The resulting theory models crossing dependency links, discontinuous nuclei and string merging, and it has been given an e cient computational rendering.
منابع مشابه
A General Probabilistic Model for Dependency Parsing
We address the question what it takes to define a correct probabilistic model for syntactic natural language processing. We focus on one particular theory of syntax, called dependency syntax, and develop a framework for developing probabilistic model for that linguistic theory. Subsequently, we review existing models of probabilistic dependency syntax and show some problematic aspects of these ...
متن کاملAn annotation scheme for Persian based on Autonomous Phrases Theory and Universal Dependencies
A treebank is a corpus with linguistic annotations above the level of the parts of speech. During the first half of the present decade, three treebanks have been developed for Persian either originally or subsequently based on dependency grammar: Persian Treebank (PerTreeBank), Persian Syntactic Dependency Treebank, and Uppsala Persian Dependency Treebank (UPDT). The syntactic analysis of a sen...
متن کاملDiscontinuous Statistical Machine Translation with Target-Side Dependency Syntax
For several languages only potentially non-projective dependency parses are readily available. Projectivizing the parses and utilizing them in syntax-based translation systems often yields particularly bad translation results indicating that those translation models cannot properly utilize such information. We demonstrate that our system based on multi bottom-up tree transducers, which can nati...
متن کاملA Chinese Dependency Syntax for Treebanking
This paper presents a Chinese dependency syntax for treebanking. The syntax contains 13 word classes and 34 dependency types. A format of treebank based on the syntax is also proposed for the applications of computational and general linguistic research. Some experiments show that the treebank based on the proposed dependency syntax can be used for training and evaluating the dependency parser ...
متن کاملA Dependency Edge-based Transfer Model for Statistical Machine Translation
Previous models in syntax-based statistical machine translation usually resort to some kinds of synchronous procedures, few of these works are based on the analysis-transfer-generation methodology. In this paper, we present a statistical implementation of the analysis-transfergeneration methodology in rule-based translation. The procedures of syntax analysis, syntax transfer and language genera...
متن کامل